Logistic Regression and Bayesian Model Selection in Estimation of Probability of Success
نویسنده
چکیده
Logistic regression and linear discriminant analysis are used to estimate probability of success for binary data based on a training sample and a certain amount of prior information. Posterior probabilities of success are calculated for different choices of the model for the training sample distribution. An approach to model selection is suggested for certain classes of distributions based on the exponential family. As an illustration, we consider two examples. One is an artificially simulated dataset from a mixture of Poisson distributions. Another deals with the history of communication between a sales agent and a customer used to estimate the probability of successful sale. The dataset was obtained from a Midwestern company that uses telesales in a business-tobusiness environment. The number of decision-maker contacts X is analyzed as an explanatory variable. A comparison is made between logistic regression technique and discriminant analysis based on negative binomial or zero-modified Poisson distribution of the number of contacts. There seems to be a good agreement between the results obtained by both methods. Discriminant analysis as demonstrated by Efron (1975) can be more efficient if some model assumptions are made regarding the distribution of X . However, logistic regression may be used in order to validate the choice for a certain class of models. A. Introduction and Methods A
منابع مشابه
Hyperbolic Cosine Log-Logistic Distribution and Estimation of Its Parameters by Using Maximum Likelihood Bayesian and Bootstrap Methods
In this paper, a new probability distribution, based on the family of hyperbolic cosine distributions is proposed and its various statistical and reliability characteristics are investigated. The new category of HCF distributions is obtained by combining a baseline F distribution with the hyperbolic cosine function. Based on the base log-logistics distribution, we introduce a new di...
متن کاملBayesian and Iterative Maximum Likelihood Estimation of the Coefficients in Logistic Regression Analysis with Linked Data
This paper considers logistic regression analysis with linked data. It is shown that, in logistic regression analysis with linked data, a finite mixture of Bernoulli distributions can be used for modeling the response variables. We proposed an iterative maximum likelihood estimator for the regression coefficients that takes the matching probabilities into account. Next, the Bayesian counterpart...
متن کاملSample size determination for logistic regression
The problem of sample size estimation is important in medical applications, especially in cases of expensive measurements of immune biomarkers. This paper describes the problem of logistic regression analysis with the sample size determination algorithms, namely the methods of univariate statistics, logistics regression, cross-validation and Bayesian inference. The authors, treating the regr...
متن کاملLogistic Regression Analysis of Some Factors Influencing Incidence of Retained Placenta in a Holstein Dairy Herd
To investigate the effects of certain factors on the rate of retained placenta, 2844 calving records from 1288 Holstein cows in a herd were used. These cows calved during year period of 2001 to 2007. A generalized statistical linear model was applied to analyze the data. Logistic regression model was applied as the statistical model. In the model, fixed effects of year, season (warm or cold) an...
متن کاملBayesian change point estimation in Poisson-based control charts
Precise identification of the time when a process has changed enables process engineers to search for a potential special cause more effectively. In this paper, we develop change point estimation methods for a Poisson process in a Bayesian framework. We apply Bayesian hierarchical models to formulate the change point where there exists a step < /div> change, a linear trend and a known multip...
متن کامل